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METHODS FOR mENTIFYlNG MODULATORS OF 
QUADRUPLEX NUCLEIC ACIDS 

Field of the Invention 

[00011 The invention concerns methods for identifying molecules that modulate a biological 
activity of a nucleic acid capable of forming secondary structures such as G-quadruplexes. 

Background 

[0002] Developments in molecular biology have led to an understanding of how certain 
therapeutic compounds interact with molecular targets and lead to a modified physiological 
condition. Specificity of therapeutic compounds for their targets is derived in part from 
interactions between complementary structural elements in the target molecule and the 
therapeutic compound. A greater variety of target structural elements in the target leads to the 
possibility of unique and specific target/compound interactions. Because polypeptides are 
structurally diverse, researchers have focused on this class of targets for the design of specific 
therapeutic molecules. 

[0003] In addition to therapeutic compounds that target polypeptides, researchers also have 
identified compounds that target DNA. Some of these compounds are effective anticancer 
agents and have led to significant increases in the survival of cancer patients. Unfortunately, 
however, these DNA targeting compounds do not act specifically on cancer cells and therefore 
are extremely toxic. Their unspecific action may be due to the fact that DNA often requires the 
uniformity of Watson-Crick duplex structures for compactly storing information within the 
human genome. This uniformity of DNA structure does not offer a structurally diverse 
population of DNA molecules that can be specifically targeted. 

[0004] Nevertheless, there are some exceptions to this structural uniformity, as certain DNA 
sequences can form unique secondary structures. For example, intermittent runs of guanines can 
form G-quadruplex structures, and complementary runs of cytosines can form i-motif structures- 
Formation of G-quadruplex and i-motif structures occurs when a particular region of duplex 
DNA transitions from Watson-Crick base pairing to intermolecular and intramolecular single- 
stranded structures. 
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Summary 

[0005] Certain regulatory regions in duplex DNA can transition into single-stranded G- 
quadruplex structures that regulate important biological processes. A gene in proximity to a G- 
quadruplex structure often is not appreciably transcribed into RNA, and certain proteins induce 
transcription and activation of the gene by facilitating the transition of quadruplex structures into 
transcribable structures. G-quadruplexes in intramolecular parallel (IP) conformations now have 
been identified as transcription regulators in certain nucleic acids. This discovery thereby allows 
for the design of candidate molecules capable of modulating the biological activity of nucleic 
acids. 

[0006] There are multiple intramolecular parallel conformations, examples of which are 
shown in Figures 1 and 2. As shown in these drawings, quadruplex structures formed by the 
CAfKCNHE region (nucleotide sequence shown in Table 1) include tetrads joined by loops, and 
the orientation of each strand contributing to the tetrads are predominantly paralleL In the 
conformations depicted in Figure 1, the four strands forming the tetrads are purely parallel (all 
four strands have a 5' to 3' orientation in the same direction). In contrast, the G-quadruplex 
conformations depicted in Figure 3 are purely antiparallel as the strand polarity sequentially 
alternates (/.e., the orientation at positions 7 to 9 is South, the orientation at positions 11 to 13 or 
12 to 14 is North, the orientation at positions 16 to 18 is South, and the orientation at positions 
20 to 22 or 21 to 23 is North). The antiparallel conformations shown in Figure 3 conform to a 
fplded-over hairpin, which often is referred to as a "chair" conformation. Like in Figure 1, 
intramolecular parallel structures are shown Figure 2. Three strands contributing to the tetrads 
of the two G-quadruplexes depicted in the middle of the drawing are parallel and one is 
antiparallel. The G-quadruplex depicted at the top right of Figure 2 also is intramolecular 
parallel, in which two parallel strands non-sequentially alternate with strands of reverse polarity 
(/.e., the orientations at positions 12 to 14 and at positions 16 to 18 are the same). Data suggests 
that the three IF G-quadruplex conformations with mixed strand polarity in Figure 2 (/.e., two 
depicted in the middle and one depicted at the top right) are particularly relevant structures for 
modulating the biological activity of certain nucleic acids by interaction with quadruplex- 
stabilizing compounds. Accordingly, the methods described herein are especially applicable to 
these three quadruplex conformations. 

[0007] Thus, featured herein is a method for identifying a candidate molecule that interacts 
with a nucleic acid capable of forming a G-quadruplex structure, which comprises contacting a 
test molecule with a nucleic acid comprising or consisting of a G-quadruplex in an IP 
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conformation, and determining whether a portion of the test molecule is in proximity to and/or 
interacts with one or more sites in the G-quadruplex. Sites with which the test molecule interact 
often are proximally oriented G-tetrads, and sometimes are within one or more bases in a loop 
region. Multi-functional test molecules sometimes interact with other portions of the nucleic 
acid, such as with a site in the G-quadruplex and a site in a secondary structure formed by an i- 
motif sequence (i.e., a sequence complementary to the G-quadruplex forming sequence) and/or a 
duplex region located between G-quadruplex and i-motif structures. A test molecule having a 
portion in proximity to the site in the G-quadruplex is identified as a candidate molecule that 
modulates the biological activity of the nucleic acid. The method also is useful for identifying a 
candidate molecule that modulates a biological activity of the nucleic acid capable of forming a 
G-quadruplex structure. 

[0008] Also featured are methods for determining whether a candidate molecule modulates a 
biological activity of the nucleic acid comprising an IP quadruplex. In certain embodiments, an 
assay useful for monitoring the IP quadruplex structure is utilized (e.g., chemical footprinting 
and/or circular dichroism assays). In specific embodiments, one or more nucleotides in the IP 
quadruplex are substituted with a fluorescent molecule (e.g., 2-amino adenosine) and structurally 
distinct forms of the labeled nucleic acid are detected under various conditions (e.g., by a gel 
electrophoresis mobility shift assay), or interactions between the labeled nucleic acid and other 
molecules are detected (e.g., by monitoring shifts in fluorescence when test molecules and other 
molecules are added to a sample containing the labeled nucleic acid). 

[0009] In addition, provided are methods for stabilizing an IP G-quadruplex conformation in 
a nucleic acid, which comprises contacting a system with a quadruplex-interacting molecule, 
whereby the molecule stabilizes the nucleic acid in an IP G-quadruplex conformation. The 
quadruplex interacting molecule sometimes selectively stabilizes an IP conformation over a non- 
IP quadruplex conformation, and sometimes selectively stabilizes one IP conformation over 
another IP conformation. In certain embodiments, the quadruplex interacting molecule is one 
set forth in U.S. application no. 10/661,241 filed September 12, 2003, and in other 
embodiments, the molecule is TMPyP4, an expanded porphoryin analog such as Se2SAP, or 
telomestatin, or a derivative thereof. Examples of other quadruplex-interacting molecules are 
described hereafter. Such processes can be performed in a cell-free system or in a system 
having cells (e.g., in a cell culture dish or in an animal or human). Also featured arc methods for 
treating a condition by administering a candidate molecule to a subject in need thereof 
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Brief Description of the Drawings 

[0010] Figure 1 depicts in the top row G-quadruplex IP structural conformations in a nucleic 
acid from the CA/FC NHE region, which are useful for identifying candidate molecules that 
modulate the biological activity of the nucleic acid. All four strands contributing to the G-tetrad 
core have parallel polarities. The IP conformations are distinguished by different nucleosides in 
loop regions (e.g., the thymidine at position 10 and the guanosine at position 1 1 in the third 
structure from the left) and tetrad regions (e.g., the guanosine at position 13 in the second 
structure from the left and the guanosine at position 14 in the third structure from the left). 

[0011] Figure 2 shows G-quadruplex structural conformations in a nucleic acid from the 
CA/yC NHE region stabilized by quadruplex-interacting molecules. The conformation 
designated "parallel" on the left side of the drawing is an IP structure depicted in Figure 1. The 
conformations depicted in the middle top and bottom of the drawing are IP conformations with 
three parallel strands and one antiparallel strand. The conformations in the middle top and 
bottom are distinguished by the orientation of particular strands forming the tetrad core, as well 
as the orientation of loops containing the thymidine at position 10 and the thymidine at position 
19. The conformation depicted on the right side of the drawing is intramolecular antiparallel, in 
which two parallel strands non-sequentially alternate with strands of reverse polarity. 

[0012] Figure 3 depicts intramolecular antiparallel conformations that may form upon 
addition of a quadruplex interacting molecule. The antiparallel conformations shown form a 
folded-over hairpin structure referred to herein as a "chair." 

Detailed Description 

[0013] Featured herein are methods for identifying a candidate molecule that interacts with a 
nucleic acid capable of forming a G-quadruplex IP structure. Such methods are useful for 
identifying molecules that modulate the biological activity of the nucleic acid by narrowing the 
number of molecules screened in biological assays. Such methods also are useful for designing 
molecules that modulate a biological activity of the nucleic acid. The compounds identified by 
the methods described herein are useful for identifying molecular therapeutics that reduce cell 
proliferation and/or microbial {e.g., bacterial or viral) proliferation, and in certain embodiments, 
treat a condition associated with the G-quadruplex in a subject. 
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Nucleic Acids 

I0014J Candidate molecules are identified by contacting a test molecule with a nucleic acid 
and determining whether the test molecule interacts with the nucleic acid and/or modulates a 
biological activity of the nucleic acid. Nucleic acids often comprise or consist of DNA (e.g., 
genomic DNA (gDNA) or complementary DNA (cDNA)) or RNA (e.g., mRNA, tRNA, and 
rRNA). In embodiments where a nucleic acid is a gDNA or cDNA fragment, the fragment often 
is 50 or fewer, 1 00 or fewer, or 200 or fewer base pairs in length, and sometimes is about 300, 
about 400, about 500, about 600, about 700, about 800, about 900, about 1000. about 1 100, about 
1200, about 1300, or about 1400 base pairs in length. In an embodiment, the nucleic acid is 
double-stranded, and is sometimes between about 30 nucleotides to about 40 nucleotides in length. 
Methods for generating gDNA and cDNA fragments are known in the art (e.g., gDNA may be 
fragmented by shearing methods and cDNA fragment libraries are commercially available). In 
embodiments where the nucleic acid is a synthetically prepared fragment nucleic acid, often 
referred to as an "oligonucleotide," the fragment sometimes is less than 30, less than 40, less than 
50, less than 60, less than 70, less than 80, less than 90, or less than 100 nucleotides in length. 
Synthetic oligonucleotides can be synthesized using standard methods and equipment, such as 
by using an ABI™3900 High Throughput DNA Synthesizer, which is available from Applied 
Biosystems (Foster City, CA). 

[0015] Nucleic acids sometimes comprise or consist of analog or derivative nucleic acids, 
such as peptide nucleic acids (PNA) and others exemplified in U.S. Patent Nos, 4,469,863; 
5,536,821; 5,541,306; 5,637,683; 5,637,684; 5,700,922; 5,717,083; 5,719,262; 5,739,308; 
5,773,601; 5,886,165; 5,929,226; 5,977,296; 6,140,482; WIPO publications WO 00/56746 and 
WO 01/14398, and related publications. Methods for synthesizing oligonucleotides comprising 
such analogs or derivatives are disclosed, for example, in the patent publications cited above, in 
U.S. Patent Nos. 5,614,622; 5,739,314; 5,955,599; 5,962,674; 6,1 17,992; and in WO 00/75372. 

[0016] The nucleic acids often include a nucleotide sequence capable of forming a 
secondary structure. Examples of secondary structures are quadruplex structures, which form 
from subsequences rich in purines (e.g., guanines in G-quadruplex structures), and i-motif 
structures, which form fh>m subsequences rich in pyrimidines (e.g., cytosines). Secondary 
structures can exist in different conformations, which differ in strand stoichiometry and/or strand 
orientation. For example, secondary structures sometimes are formed by interstrand 
interactions, in which the interacting strands are in the same direction (i.e., parallel, the 
interacting strands are oriented 5* to 3*) or in different directions (i.e., antiparallel, the 
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interacting strands are oriented 5' to 3' and 3' to 5*), and sometimes are formed by intrastrand 
interactions, examples of the latter of which are described in Figure 1 . Quadruplex structures 
sometimes form because certain purine rich strands are capable of engaging in a slow 
equilibrium between a typical duplex helix structure and both unwound and non-B-form 
substructures. These unwound and non-B forms sometimes are referred to as "paranemic 
structures," and some forms are associated with sensitivity to SI nuclease digestion, which 
sometimes are refened to as "nuclease hypersensitivity elements" or •'NHEs." A quadruplex is 
one type of paranemic structure and certain NHEs can adopt a quadruplex structure. The entire 
length of the nucleic acid sometimes participates in the quadruplex structure, and a portion of the 
nucleic acid length (i.e., a subsequence) often forms a quadruplex structure. 

[0017] The ability of guanine-rich nucleic acids of adopting G-quadruplex conformations is 
due to the formation of guanine tetrads through Hoogsteen hydrogen bonds. One nucleic acid 
sequence can give rise to different quadruplex orientations, where the different conformations 
depend upon conditions under which they form, such as the concentration of potassium ions 
present in the system and the time that the quadruplex is allowed to form. Different quadruplex 
conformations can be distinguished from one another using standard procedures such as 
chemical footprinting and circular dichroism analyses (see e.g., U.S. application no. 10/407,449 
filed April 4, 2003). Also, multiple conformations can be in equilibrium with one another, and 
can be in equilibrium with a duplex conformation if a complementary strand exists in the 
system. For example, the DP conformations described herein may be in equilibrium with a chair 
conformation (i.e., the latter conformation having bridging loops running orthogonal to two 
parallel loops and resuhing from the simple folding-over of a DNA G-hairpin) or a basket 
conformation. The equilibrium may be shifted to favor one conformation over another such that 
the favored conformation is present in a higher concentration or fraction over the other 
conformation or other conformations. A certain conformation also may be trapped, for example, 
by selectively binding one conformation over others by a compound that stabilizes the particular 
conformation. The terms "favor*' and *trap" as used herein refer to one conformation being at a 
higher concentration or firaction relative to other conformations, and also refer to stabilizing the 
particular quadruplex conformation. The terms "hinder" or "non-trapped" as used herein refer to 
one conformation being at a lower concentration with respect to other conformations. One 
conformation may be favored or trapped over another conformation if it is present in the system 
at a fraction of 50% or greater, 75% or greater, or 80% or greater or 90% or greater with respect 
to another conformation (e.g., another quadruplex conformation, another paranemic 



6 



PATENT 

Docket 532233001400 



conformation, or a duplex conformation). Conversely, one conformation may be hindered or not 
trapped if it is present in the system at a fraction of 50% or less, 25% or less, or 20% or less and 
10% or less, with respect to another conformation. 

[0018J Equilibrium can be shifted to favor one quadruplex form over another by employing 
a variety of methods. For example, certain bases in a quadruplex nucleic acid may be mutated to 
prevent the formation of one conformation. Typically, these mutations are located in tetrad 
regions of the quadruplex (i.e., regions in which four bases interact with one another in a planar 
orientation). Also, ion concentrations and the time with which a quadruplex nucleic acid is 
contacted with certain ions can favor one conformation over another. For example, potassium 
ions stabilize quadruplex structures, and higher concentrations of potassium ions and longer 
contact times of potassium ions with a quadruplex nucleic acid can favor one conformation over 
another. A particular quadruplex conformation, such as IP and chair conformations, can be 
favored with contact times of 5 minutes or less in solutions containing 100 mM potassium ions, 
and often 10 minutes or less, 20 minutes or less, 30 minutes or less, and 40 minutes or less. 
Basket conformations typically require longer contact times with potassium ions. Potassium ion 
concentration and the counter anion can vary, and the specific quadruplex conformations 
existing for a given set of conditions can be determined. Furthermore, different quadruplex 
structures may be distinguished, trapped and favored by probing them with molecules that 
favorably interact with one quadruplex form over another (e.^., TMPyP4 binds with a higher 
affinity to chair structures as opposed to basket and IP structures). Quadruplex-interacting 
compounds sometimes bind with higher afHnity to particular quadruplex structures in vitro than 
in vivo (e.g., the compound teloraestatin binds to IP and chair structures in vitro but more 
favorably to chair structures in vivo), 

[0019] Particular nucleotide sequences in a nucleic acid sometimes direct the type of 
secondary structure or structures that the nucleic acid is capable of adopting. For example, 
nucleic acid sequences conforming to the motif (GaXb)cGa sometimes form an intramolecular IP 
or antiparallel G-quadruplex structure. Sometimes a is an integer between 2 and 6 and b is an 
integer between 1 and 4, and often, b is the integer 2 or 3. A nucleic acid often includes one or 
more flanking nucleotides on the 5' and/or 3* end of the nucleotide sequence that forms the 
quadruplex and are not part of the quadruplex structure. These motifs can be used to identify 
other quadruplex-forming sequences in regions of a genome operably linked to a gene. G- 
quadruplexes formed by sequences conforming to this motif sometimes include 2 to 6 G-tetrads, 
and often include between 3 and 5 G-tetrads. 
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(0020) Sometimes, a nucleic acid capable of forming one or more secondary structures 
includes a nucleotide sequence identical to a native nucleotide sequence present in genomic 
DNA. For example, a nucleic acid often comprises or sometimes consists of a nucleotide 
sequence or a portion of a nucleotide sequence set forth in Table 1. The nucleotide sequences in 
Table 1 originate from regions in genomic DNA that are capable of forming a quadruplex 
structure, which can regulate transcription of the CMYQ PDGFA, PDGFB/c-sis, c-ABL, RET, 
BCL-2, Cyclin DI/BCL-1, KRAS, HRASmA VEGF, 



TABLE 1 



Sequence 


ID NO 


Origin 


TG4AGJTG4AG3TG4AAGG 


1 


CMYC 


G13CG5CG5CG5AG4T 


2 


PDGFA 


G8ACGCG3AGCTGsAG3CTrG4CCAG3CG4CGCTTAG5 


3 


PDGFB/c-sis 


AGGAAG4AG3CCG6AGGTGGC 


4 


c-ABL 


G5(CG4)3 


5 


RET 


G3AGGAAG5CG3AGTCG4 


6 


BCL-2 


G4ACGCG3CG5CG6AG3CG 


7 


Cyclin D\/BCL-1 


(G3A)3AGGA(G3A)4GC 


8 


K-RAS 


G5(CG4)3 


9 


H-RAS 


C3G4CG3C2G5CG4TC3G2CG5CG2AG 


10 


VEGF 



While quadruplex forming sequences typically arc identified in regulatory regions upstream of a 
gene (e.g,, a promoter or a 5' untranslated region (UTR)), quadruplex forming sequences also 
may be identified within a 3' UTR or within an intron or exon of a gene. 

[0021] A nucleic acid sometimes includes a nucleotide sequence similar to or substantially 
identical to a native nucleotide sequence. A similar or substantially identical nucleotide 
sequence may include modifications to the native sequence, such as substitutions, deletions, or 
insertions of one or more nucleotides. The substantially identical sequence sometimes conforms 
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to the (GaXb)cGa motif described above. The term "substantially identical" refers to two or more 
nucleic acids sharing one or more identical nucleotide sequences. Included are nucleotide 
sequences that sometimes are 55%, 60%, 65%, 70%, 75%, 80%, or 85% identical to a native 
quadruplex-forming nucleotide sequence, and often are 90% or 95% identical to the native 
quadruplex-forming nucleotide sequence (each identity percentage can include a 1%, 2%, 3% or 
4% variance). One test for determining whether two nucleic acids are substantially identical is 
to determine the percentage of identical nucleotide sequences shared between the nucleic acids. 

[0022] Calculations of sequence identity can be performed as follows. Sequences are 
aligned for optimal comparison purposes and gaps can be introduced in one or both of a first and 
a second nucleic acid sequence for optimal alignment. Also, non-homologous sequences can be 
disregarded for comparison purposes. The length of a reference sequence aligned for 
comparison purposes sometimes is 30% or more, 40% or more, 50% or more, often 60% or 
more, and more often 70%, 80%, 90%, 100% of the length of the reference sequence. The 
nucleotides at corresponding nucleotide positions then are compared among the two sequences. 
When a position in the first sequence is occupied by the same nucleotide as the corresponding 
position in the second sequence, the nucleotides are deemed to be identical at that position. The 
percent identity between the two sequences is a ftinction of the number of identical positions 
shared by the sequences, taking into account the number of gaps, and the length of each gap, 
introduced for optimal alignment of the two sequences. 

[0023] Comparison of sequences and determination of percent identity between two 
sequences can be accomplished using a mathematical algorithm. Percent identity between two 
nucleotide sequences can be determined using the algorithm of Meyers & Miller, CABIOS 
4:1 1-17 (1989), which has been incorporated into the ALIGN program (version 2.0), using a 
PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4. Percent 
identity between two nucleotide sequences can be determined using the GAP program in the 
GCG software package (available at http address www.gcg.com), using a NWSgapdna.CMP 
matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. A set 
of parameters often used is a Blossum 62 scoring matrix with a gap open penalty of 12, a gap 
extend penalty of 4, and a frame shift gap penalty of 5. 

[0024] Another manner for determining if two nucleic acids are substantially identical is to 
assess whether a polynucleotide homologous to one nucleic acid will hybridize to the other 
nucleic acid under stringent conditions. As use herein, the term "stringent conditions" refers to 
conditions for hybridization and washing. Stringent conditions are known to those skilled in the 
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art and can be found in Current Protocols in Molecular Biology^ John Wiley & Sons, N.Y,, 
6.3.1-6.3.6 (1989), Aqueous and non-aqueous methods are described in that reference and either 
can be used. An example of stringent conditions is hybridization in 6X sodium chloride/sodium 
citrate (SSC) at about 45^C, followed by one or more washes in 0.2X SSC, 0.1% SDS at 50^*0. 
Another example of stringent conditions are hybridization in 6X sodium chloride/sodium citrate 
(SSC) at about followed by one or more washes in 0.2X SSC, 0. 1% SDS at 55°C. A 
further example of stringent conditions is hybridization in 6X sodium chloride/sodium citrate 
(SSC) at about 45^C, followed by one or more washes in 0.2X SSC, 0.1% SDS at 60X. Often, 
stringent conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 
45X, followed by one or more washes in 0.2X SSC, 0.1% SDS at 65°C. Also, stringency 
conditions include hybridization in 0.5M sodium phosphate, 7% SDS at 65**C, followed by one 
or more washes at 0.2X SSC, 1% SDS at 65X. 

|0025] Also, nucleotide sequences of native quadruplex-forming nucleotide sequences may 
be used as ''query sequences" to perform a search against public databases to identify related 
sequences. Such searches can be performed using the NBLAST and XBLAST programs 
(version 2.0) of Altschul, etal, J. Mol Biol 215:403-410 (1990). BLAST nucleotide searches 
can be performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
homologous nucleotide sequences. To obtain gapped alignments for comparison purposes. 
Gapped BLAST can be utilized as described in Altschul, et al. Nucleic Acids Res. 25(17):3389- 
3402 (1997). When utilizing BLAST and Gapped BLAST programs, default parameters of the 
respective programs (e.g., XBLAST and NBLAST) can be used (see, http address 
www.ncbi.nlm.nih.gov). 

Contacting Nucleic Acids with Test Molecules 
[0026] The nucleic acid is contacted with a test molecule, the latter of which is described in 
further detail hereafter, in a variety of manners. In certain embodiments, a physical 
representation of the nucleic acid and the test molecule is contacted and interactions between the 
physical representations are observed. In other embodiments, a representation of the nucleic 
acid is contacted with a representation of a test molecule on paper or in silico {i.e., in a computer 
environment). 

[0027] In in silico processes, three-dimensional representations of the nucleic acid and the 
test molecule often are brought into proximity with one another. The three dimensional 
representations of the nucleic acid and the test molecule are oriented with one another to 
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maximize favorable interactions (e.g., ionic interactions, hydrogen bonds and/or hydrophobic 
interactions) and minimize unfavorable interactions (e.g., steric clashes) between portions of the 
test molecule and portions of the nucleic acid, which minimizes the energy of the complex. The 
contacting process sometimes is performed manually (e.g., the orientation of three-dimensional 
representations of the nucleic acid and the test molecule are manipulated on a computer by a 
user), sometimes is accomplished in an automated manner using computer software (e.g., the 
software docks the three-dimensional representations of the nucleic acid and the test molecule 
and then energy minimizes the resulting structures), and often is accomplished by a combination 
of a manual and automated method. Commercially available computer software is available for 
carrying out these processes (see, e.g., AutoDock, DOCK, Dock/Insightll, FlexX, 
MULTIDCX^K, FTDOCK and GRAMM computer programs; see also http address 
www.molvisxhem.indiana.edu/ app _guide/ for other computer programs useful for visualizing 
three dimensional representations of molecules and manipulating them). Using such software, 
the test molecule sometimes is part of a molecular library from which a computer program often 
samples test molecules iteratively and docks them with a nucleic acid. In other embodiments, a 
computer program is utilized to build test molecules having portions that meet specific criteria. 
In an example of the latter embodiments, a test molecule can be built by specifying that a 
hydrophobic moiety must be in proximity to a duplex region located between a G-quadruplex 
structure and an i-motif and also specify that a moiety capable of hydrogen bonding with 
nucleotides in a G-quadruplex structure be present and specifically oriented in the test molecule. 

[0028] The nucleic acid sometimes is contacted with a nucleic acid binding protein in silico 
by docking three-dimensional renditions of the two molecules to one another. 
Three-dimensional renditions of test molecules sometimes are contacted with the three- 
dimensional rendition of the docked nucleic acid/protein structure and sometimes the test 
molecule or candidate molecule is contacted with the undocked nucleic acid and/or binding 
protein before the nucleic acid and protein are docked to one another. In an embodiment, the- 
nucleic acid comprises or consists of a structure set forth in Figure 1 . In another embodiment, 
the nucleic acid is contacted with one or more nucleic acid binding protein, such as NM23-H2 
{see e.g., deposit reference INUE at http address rutgers.rcsb.org/pdb/ for coordinates of a three- 
dimensional representation of an NM23-H2 protein) and/or another nucleic acid binding protein 
(e.g., CNBP, hnRNPK or Spl). In certain embodiments, the nucleic acid is docked with a 
portion of a nucleic acid binding protein before, during, or after the nucleic acid and/or the 
nucleic acid binding protein portion are contacted with a test molecule or candidate molecule. 
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[0029] Based in part upon the proximity of specific atoms in the test molecule and in the 
nucleic acid (e.g., distance between the atoms), it can be determined whether one or more atoms 
in the test molecule and in the nucleic acid interact with one another. Atoms in the two 
molecules can interact with one another to form hydrogen bond interactions, Van der Waals 
interactions, ionic interactions, and hydrophobic interactions, for example. A portion of a test 
molecule often is in proximity to a site in the nucleic acid molecule when the closest atoms in 
the two molecules arc within a distance of about 8 A from one another, about 5 A from one 
another, about 4 A from one another, about 3 A from one another, about 2 A from one another, 
about 1 A from one another, or about 0.5 A from one another. The molecules sometimes are 
determined as being in proximity to one another based upon a relative distance, where the 
distance between an atom in the test molecule and an atom at a site of the nucleic acid is closer 
to one another than the distance between other test molecule atoms and nucleotide atoms. 

[0030] Candidate molecules are selected from test molecules meeting certain criteria. In an 
embodiment, one criterion is that a portion of the test molecule is in proximity to a site and 
optionally interacts with a site comprising one or more atoms of a nucleoside in a G-tetrad of the 
G-quadniplex IP structure. For example, a portion of the test molecule may be in proximity and 
optionally interact widi a portion of a guanosine in a tetrad, such as a tetrad formed by 
guanosines at positions 9, 13, 14, 18, 22, 23 or positions 7, 1 1, 12, 16, 20, 21 or a combination 
thereof in the IP quadruplex structures of Figure 1 . A portion of the test molecule also may be 
in proximity to and optionally interact with one or more atoms In a nucleoside located in a loop 
region (e.g., an atom in a thymidine at position 10, a guanosine at position 1 1, a guanosine at 
position 14, an adenosine at position 15, a thymidine at position 19, a guanosine at position 20, 
or a combination thereof in the loop regions of the G-quadruplex IP structures in Figure 1), and 
sometimes, may be in proximity to or optionally interact with one or more atoms in a nucleoside 
not in a loop or tetrad (e.g., an adenosine at position 6 or a guanosine at position 23 in Figure 1). 

[0031] Of the intramolecular parallel conformations having three parallel strands and one 
antiparallel strand, a portion of the test molecule may be in proximity to and optionally interact 
with one or more atoms of a tetrad guanosine, such as a guanosine in a tetrad at position 7, 13, 
18 or 22; 9, 13, 18 or 20; 9, 1 1, 16 or 20; or 7 ,1 1, 16 or 22 as shovm in Figure 2 or the like. For 
the latter G-quadruplex conformations, a portion of the test molecule may be in proximity to and 
optionally interact with one or more atoms of a loop nucleoside, such as a thymidine at position 
10, a guanosine at position 1 1, an adenosine at position 15, a thymidine at position 19, and a 
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guanosine at position 20, and sometimes, may be in proximity to or optionally interact with one 
or more atoms in a nucleoside not in a loop or tetrad, such as an adenosine at position 6. 

(0032J Of intramolecular G-quadruplex conformations having mixed strand polarity, such as 
the conformation shown on the right of Figure 2 and the like, a portion of the test molecular may 
in proximity to and optionally interact with one or more atoms of a tetrad guanosine, such as a 
guanosine in a tetrad at position 7. 14, 18 or 21 or position 9, 12, 16 or 23, A portion of the test 
molecule also may be in proximity to and optionally interact with one or more atoms in a 
nucleoside located in a loop region (e.g., an atom in a thymidine at position 10, a guanosine at 
position 1 1, an adenosine at position IS, a thymidine at position 19, a guanosine at position 20, 
or a combination thereof), and sometimes, may be in proximity to or optionally interact with one 
or more atoms in a nucleoside not in a loop or tetrad (e.g., an adenosine at position 6). 

[0033] The methods described herein also are applicable to intramolecular antiparallel 
conformations, such as those shown in Figure 3 and the like. A portion of a test molecule may 
be in proximity to and optionally interact with one or more atoms in a tetrad guanosine, such as 
a guanosine in a tetrad at position 7, 13, 16 or 22; 7, 13, 16 or 23; 7, 14, 16 or 22; 7, 14, 16 or 
23; 9, 11, 18or20;9, 11, 18or21;9, 12, 18or20;or9, 12, 18or21. A portion of the test 
molecule may be in proximity to and optionally interact with one or more atoms of a loop 
nucleoside in an intramolecular antiparallel structure, such as a thymidine at position 10, a 
guanosine at position 1 1, a guanosine at position 14, an adenosine at position IS, a thymidine at 
position 19 and/or a guanosine at position 20, and sometimes, may be in proximity to or 
optionally interact with one or more atoms in a nucleoside not in a loop or tetrad, such as an 
adenosine at position 6 or a guanosine at position 23. 

[0034] Nucleic acid subsequence having sufficient guanosines to form G-quadruplex tetrads 
can be aligned to form the conformations depicted in Figure 1 and Figure 2, even where the 
nucleotide subsequence differs from the CMYC NHE region subsequence and the nucleosides 
and number of nucleosides not forming the tetrad core differ. For example, nucleotide 
subsequences having sufficient guanosines to form G-tetrads in a quadruplex, such as those 
shown in Table 1, can be assembled into the nucleic acid configurations shown in Figures 1 and 
2. 

[0035] Test molecules that interact with more than one atom in a nucleoside participating in 
a G-tetrad or a loop often are identified as candidate molecules. In addition to interacting with 
atoms in a G-quadruplex, a candidate molecule sometimes interacts with one or more atoms in 
an adjacent secondary structure. An adjacent secondary structure sometimes is formed by an 
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i-motif complementary to the nucleotide sequence that forms the G-quadruplex. In other 
embodiments, the adjacent secondary structure is formed from a nucleotide sequence located 
upstream or downstream of the same strand having the G-quadruplex-forming nucleotide 
sequence that interacts with the G-quadruplex (e.g., a silencer sequence in association with a G- 
quadruplex formed by a nucleotide sequence located 5' of the PDGF receptor open reading 
frame; see e.g., U.S. application no. 60/425,869 filed November 12, 2002), The candidate 
molecule sometimes forms one or more hydrogen bonds and/or sometimes forms a hydrophobic 
interaction (e.g., end-staclcs or intercalates) with one or more atoms in a G-quadruplex structure 
and/or an adjacent secondary structure. A portion of a candidate molecule sometimes forms a 
hydrogen bond and/or intercalates with one or more atoms in a duplex region flanking the G- 
quadruplex and/or the adjacent secondary structure. 

[0036] In addition to interactions with one IP quadruplex conformation, a candidate 
molecule sometimes interacts with another G-quadruplex conformation, such as another IP 
conformation or an intramolecular antiparaliel conformation. In certain embodiments, a 
candidate molecule interacts with a nucleic acid when the latter is in an IP quadruplex 
conformation having four parallel strands contributing to the tetrad core and in an IP quadruplex 
conformation having two or three parallel strands contributing to the tetrad core. In another 
embodiment, the candidate molecule interacts with an IP quadruplex conformation having four 
parallel strands with a lower affinity than an IP quadruplex conformation having two or three 
parallel strands. In another embodiment, a candidate molecule identified as interacting with an 
IP quadruplex is tested to determine whether it can interact with a non-IP quadruplex form or 
structure (e.g., antiparaliel, basket, intermolecular). 

[0037J Test molecules can be optimized to form specific interactions with one or more sites 
of the nucleic acid. For example, a test molecule can be designed or optimized to include a 
chemical moiety capable of forming an interaction with an epsilon-amino moiety from a lysine 
in the nucleic acid binding protein (e.g., lysine 12 in a subunit of NM23-H2), thereby inhibiting 
relaxation of secondary structures in the nucleic acid, inhibiting transcription of the CMYC gene, 
and inhibiting or reducing cell proliferation. In the latter embodiment, such an inhibitor 
sometimes acts as a suicide inhibitor which traps the binding protein in an unproductive 
conformation. The test molecule also can be designed or optimized to include portions that 
interact with the G-quadruplex structure and/or an i-motif structure and optionally to include a 
moiety that intercalates with the intermediate duplex region or part of the duplex region. 
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Intercalation moieties often are hydrophobic and examples of such moieties are known (e.g., 
ethidium bromide and odier moieties described below). 

Test Molecules and Candidate Molecules 

[0038] The nucleic acid is contacted with one or more test molecules to identify candidate 
molecules that modulate a biological activity of the nucleic acid. Molecules often are organic or 
inorganic compounds having a molecular weight of 10,000 grams per mole or less, and 
sometimes having a molecular weight of 5,000 grams per mole or less, 1,000 grams per mole or 
less, or SOO grams per mole or less. Also included are salts, esters, and other pharmaceutically 
acceptable forms of the compounds. Compounds that interact with nucleic acids are known in 
the art {see, e.g.. Hurley, Nattire Rev. Cancer 2:188-200 (2002); Anantha, el al.. Biochemistry 
Vol. 37, No. 9:2709-2714 (1998); and Ren, et al. Biochemistry 38:16067-16075 (1999)). 

[0039] Compounds can be obtained using known combinatorial library methods, including 
spatially addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; "one-bead one-compound" library methods; and synthetic library 
methods using affinity chromatography selection. Examples of methods for synthesizing 
molecular libraries are described, for example, in DeWitt, et aL^ Proc. Natl Acad. Set. U.S.A. 
90:6909 (1993); Erb, et al., Proc. Natl. Acad. Sci. USA 91 :1 1422 (1994); Zuckenmann, et al, 
J. Med. Chem. 37:2678 (1994); Cho, et al. Science 261:1303 (1993); Carrell, etal. Angew. 
Chem. Int. Ed. Engl 33:2059 (1994); Carell, et al, Angew. Chem. Int Ed. Engl 33:2061 (1994); 
and Gallop, etal, J. Med Chem, 37:1233 (1994). 

[0040] In addition to an organic and inorganic compound, a molecule sometimes is a nucleic 
acid, a catalytic nucleic acid (e.^., a ribozyme), an inhibitory RNA (RNAi or siRNA), a 
nucleotide, a nucleotide analog, a polypeptide, an antibody, or a peptide mimetic. Methods for 
making and using these molecules are known. For example, methods for making ribozymes and 
assessing ribozyme activity are described (see e.g., U.S. Patent Nos. 5,093,246; 4,987,071; and 
5,1 16,742; Haselhoff & Geriach, Nature 334:585-5ri (1988) and Bartel & Szostak, Science 
261 :141 1-1418 (1993)). Also, methods for generating siRNA are known (see e.g., Elbashir, et 
al. Methods 26:199-213 (2002) and http address www.dharmacon.com) and peptide mimetic 
libraries are described (see, e.g., Zuckermann, et al, J. Med. Chem. 37:2678-2685 (1994)). 

[0041] Candidate molecules sometimes meet the following structural and chemical criteria. 
A candidate molecule sometimes interacts with one region of the nucleic acid or the nucleic acid 
binding protein, and sometimes interacts with two or more regions of the nucleic acid and/or the 
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nucleic acid binding protein (e.g., the candidate molecule can interact with two or more regions 
in the nucleic acid or nucleic acid binding protein, or can interact with a region in the nucleic 
acid and a region in the nucleic acid binding protein). Each molecule sometimes includes a 
moiety that interacts with a G-quadruplex structure, an i-motif structure and/or a duplex 
structure of the nucleic acid, and/or an amino acid of the nucleic acid binding protein. In an 
embodiment, a moiety in the molecule is capable of end-stacking with or intercalating between 
one or more G-tetrads of a G-quadruplex, such as a moiety comprising a planar or polycyclic 
structure, for example. Examples of such moieties are anthraquinone, acrtdone, napthyl, 
pheoxazine, xanthone, benzoxazole, phenathiazine, phenazine, benzothiazole, acridine, 
dibenzofuran, benzimidazole, fluorenone, fluorene, and phenanthroline. In another embodiment, 
a moiety is a duplex DNA intercalator, capable of binding to a duplex DNA region adjacent to a 
secondary structure in the nucleic acid, such as a moiety having a planar or polycyclic structure 
(e.g., an intercalator listed previously). In a related embodiment, the moiety is capable of 
groove-binding to a duplex region in the nucleic acid, such as a polypeptide or sugar-based 
moiety capable of groove binding. In other embodiments, a moiety is capable of binding to an 
amino acid of a nucleic acid binding protein (e.g., NM23 protein), such as a nucleotide or a 
nucleotide mimetic, or a carbonyl-, acetal-, or imine-containing moiety. Molecules having 
quadruplex-interacting moieties are disclosed in application no. 10/407,449 filed April 4, 2003; 
application no. 10/660,897 filed September 1 1, 2003; application no. 10/661,241 filed 
September 12, 2003; application no. 60/463,171 filed April 15, 2003 and application no. 
60/461,205 filed April 7, 2003. 

[0042] Each molecule that interacts with two or more target regions in a nucleic acid and/or 
a nucleic acid binding protein sometimes comprises two or more moieties that independently 
interact with the target regions and are joined by a linker. A linker joining the moieties 
sometimes is 7.5 A to 40 A in length, sometimes comprises between 5 and 20 atoms, often is 
flexible, and sometimes is constrained (e.g., in a conformation that follows the groove of duplex 
DNA). The linker sometimes comprises polyamide or polysaccharide moieties (e.g., comprising 
amino saccharide units), and sometimes includes known linkage fiinctionalities such as those 
independently selected from amide, ester, ether, amine, sulfide, sulfonamide, alkyi or aryl, for 
example. 

[0043] Featured herein is structural information descriptive of the candidate molecules and 
therapeutics identified by the processes described herein as modulating the biological activity of 
a nucleic acid capable of adopting a G-quadruplex IP conformation. As described above, the 
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candidate molecule or therapeutic may modulate the biological activity by interacting with G- 
quadruplexes in other conformations, such as the chair conformation for example. In certain 
embodiments, information descriptive of candidate molecule structure (e.g., chemical formula or 
sequence information) sometimes is stored and/or renditioned as an image or as three- 
dimensional coordinates. The information often is stored and/or renditioned in computer 
readable form and sometimes is stored and organized in a database. In certain embodiments, the 
information may be transferred from one location to another using a physical medium (e.g., 
paper) or a computer readable medium (e.g., optical and/or magnetic storage or transmission 
medium, floppy disk, hard disk, random access memory, computer processing unit, facsimile 
signal, satellite signal, transmission over an internet or transmission over the world-wide web). 

Biological Activity of Candidate Molecules 

[0044] Determining whether the biological activity of a native quadruplex DNA is 
modulated in a cell, tissue, or organism can be accomplished by monitoring modulation of a 
signal in an in vitro or in vivo assay. The signal in the assay sometimes is generated or 
modulated by transcription of the nucleic acid, binding of a protein to the nucleic acid (e.g., 
NM23-H2), a fluorophore incorporated in the nucleic acid, or cell proliferation, for example. 
Transcription can be detected, for example, by directly detecting RNA transcripts or detecting 
polypeptides translated by transcripts, which are methods known in the art. 

[0045] Candidate molecules, some identified by the procedures described herein, sometimes 
are screened in in vitro or in vivo assays to determine whether they modulate the biological 
activity of the nucleic acid. Candidate molecules and nucleic acids can be added to an assay 
system in any order to determine whether the candidate molecule modulates the biological 
activity of the nucleic acid. For example, a candidate molecule sometimes is added to an assay 
system before, simultaneously, or after a nucleic acid is added. 

[0046] In these assays, candidate molecules are contacted with the nucleic acid in the assay 
system, where the term "contacting" refers to placing a candidate molecule in close proximity to 
a nucleic acid and allowing the assay components to collide with one another, often by diffusion. 
Contacting these assay components with one another can be accomplished by adding them to a 
body of fluid or in a reaction vessel, for example. The components in the system may be mixed 
in variety of manners, such as by oscillating a vessel, subjecting a vessel to a vortex generating 
apparatus, repeated mixing with a pipette or pipettes, or by passing fluid containing one assay 
component over a surface having another assay component immobilized thereon, for example. 
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[0047] As used herein, the term "system" refers to an environment that receives the assay 
components, which includes, for example, microtiter plates {e.g., 96-well or 384-well plates), 
silicon chips having molecules immobilized thereon and optionally oriented in an array {see, 
e.g., U.S. Patent No. 6,261,776 and Fodor, Nature 364:555-556 (1993)), and microfluidic 
devices {see, e.g., U.S. Patent Nos. 6,440,722; 6,429,025; 6,379,974; and 6,316,781). The 
system can Include attendant-equipment for carrying out the assays, such as signal detectors, 
robotic platforms, and pipette dispensers. 

[0048] One or more assay components (e.g., the nucleic acid, candidate molecule or nucleic 
acid binding protein) sometimes are immobilized to a solid support. The attachment between an 
assay component and the solid support often is covalent and sometimes is non-covalent {see, 
e.g., U.S. Patent No. 6,022,688 for non-covalent attachments) and the attachment sometimes is 
by a linking moiety. The solid support often is one or more surfaces of the system, such as one 
or more surfaces in each well of a microtiter plate, a surface of a silicon wafer, a surface of a 
bead (^ee, e.g.. Lam, Nature 354: 82-84 (1991)) optionally linked to another solid support, or a 
channel in a microfluidic device, for example. Types of solid supports, linker molecules for 
covalent and non-covalent attachments to solid supports, and methods for immobilizing nucleic 
acids and other molecules to solid supports are known (^ee, e.g., U.S. Patent Nos. 6,261,776; 
5,900,481; 6,133,436; and 6,022,688; and WIPO publication WO 01/18234). 

[0049] Protein molecules sometime are contacted with the nucleic acid. Polypeptide 
molecules sometimes are added to the system in free form, and sometimes are linked to a solid 
support or another molecule. For example, polypeptide test molecules sometimes are linked to a 
phage via a phage coat protein. The latter embodiment often is accomplished by using a phage 
display system, where nucleic acids linked to a solid support arc contacted with phages that 
display different polypeptide candidate molecules. Phages displaying polypeptide candidate 
molecules that interact with the immobilized nucleic acids adhere to the solid support, and phage 
nucleic acids corresponding to the adhered phages then are isolated and sequenced to determine 
the sequence of the polypeptide test molecules that interacted with the immobilized nucleic 
acids. Methods for displaying a wide variety of peptides or proteins as fiisions with 
bacteriophage coat proteins are known (Scott and Smith, Science 249:386-390 (1990); Devlin, 
Science 249:404-406 (1990); Cwirla, etaL Proc. Natl. Acad Sci. 87:6378-6382 (1990); Felici, 
J. MoL Biol 222:301-310 (1991); U.S. Patent Nos. 5,096,815 and 5,198,346; U.S. Patent 
Nos. 5,223,409; 5,403,484; 5,571,698; and 5,766,905). Methods also are available for linking 
the test polypeptide to the N-terminus or the C-temiinus of the phage coat protein. 
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[0050] A signal generated by the system when a candidate molecule binds to a nucleic acid 
and/or a nucleic acid binding protein often scales directly with a range of increasing nucleic 
acid, nucleic acid binding protein, or candidate molecule concentrations. Signal intensity often 
exhibits a hyperbolic relationship when plotted as a function of nucleic acid, candidate molecule, 
or nucleic acid binding protein concentrations. The signal sometimes is increased relative to 
background signal levels when a candidate molecule binds to a nucleic acid and/or a nucleic acid 
binding protein, and sometimes the signal decreases relative to background signal levels under 
such circumstances. The candidate molecules often interact with the nucleic acid and/or nucleic 
acid binding protein by reversible binding, and sometimes interact with irreversible binding. For 
example, the candidate molecule may reversibly form a covalent bond between a portion of the 
candidate molecule and an amino acid side chain in the protein (e.g., a lysine), depending on the 
chemical structure of the candidate molecule. 

(0051) Candidate molecules often are identified as interacting with the nucleic acid and/or 
nucleic acid binding protein when the signal produced in a system containing the candidate 
molecule is different than the signal produced in a system not containing the candidate molecule. 
While background signals may be assessed each time a new candidate molecule, nucleic acid, or 
nucleic acid binding protein is probed by the assay, detecting the background signal often is not 
required each time a new test molecule or test nucleic acid is assayed. Control assays also can 
be performed to determine background signals and to rule out false positive results and false 
negative results. Such control assays often do not comprise one or more assay components 
included in other assays (e.g., a control assay sample sometimes does not include a candidate 
molecule, a nucleic acid, or a protein that interacts with the nucleic acid). 

[0052] In addition to determining whether a candidate molecule gives rise to a different 
signal, the affinity of the interaction between the candidate molecule with the nucleic acid and/or 
nucleic acid binding protein sometimes is quantified. ICso» IQ, or Ki threshold values sometimes 
are compared to the measured IC50 or K4 values for each interaction, and thereby are used to 
identify a candidate molecule that interacts with the nucleic acid or nucleic acid binding protein 
and modulates the biological activity. For example, IC50 or Ka threshold values of 10 or 
less, 1 or less, and 100 nM or less often are utilized, and sometimes threshold values of 
10 nM or less, 1 nM or less, 100 pM or less, and 10 pM or less are utilized to identify candidate 
molecules that interact with nucleic acids and/or binding proteins and modulate the biological 
activity. 
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[0053] Specific assays sometimes are utilized to identify candidate molecules that modulate 
the biological activity of a nucleic acid and/or binding protein complex. For example, 
fluorescence assays, gel mobility shift assays {see, e.g., Jin & Pike, Mol. Endocrinol. 10:196-205 
( 1 996) and Postel, J. Biol. Chem. ll^aiH 1 -22829 ( 1 999)), polymerase arrest assays, 
transcription reporter assays, DNA cleavage assays, protein binding and apoptosis assays (see, 
e.g., Amersham Biosciences (Piscataway, New Jersey)) sometimes are utilized. Also, 
topoisomerase assays sometimes are utilized subsequently to determine whether the quadruplex 
interacting molecules have a topoisomerase pathway activity {see, e.g., TopoGEN, Inc. 
(Columbus, Ohio)). 

[0054] A fluorescence interaction assay is useful for identifying candidate molecules that 
interact with DNA capable of forming a quadruplex structure. In particular, such assays are 
useful in in vitro high-throughput assays and in gel electrophoretic mobility shift assays. Such 
methods sometimes comprise contacting a sample comprising a nucleic acid with a candidate 
molecule, where the nucleic acid includes or consists of nucleotide sequence that is identical or 
substantially similar to a native nucleotide sequence capable of forming a G-quadruplex 
structure. One or more nucleoside moieties in the native nucleotide sequence sometimes are 
substituted with a fluorescent nucleoside analog. Examples of such fluorescent nucleoside 
analogs are 2-amino purine (e.g., 2-amino adenosine), pyrrolo-C, 6-MAP, and furano-dT (for 
other examples, see http address www.gIenresearch.com/GlenReports/GR15-13.html). In 
specific embodiments, one or more nucleoside moieties in a loop region of an IP structure in 
Figure 1 are substituted with a fluorescent nucleoside analog such as 2-amino purine or pyrrolo- 
C. A fluorescent signal generated by the sample is detected after the sample is contacted by the 
candidate molecule, and the test molecule is identified as a candidate molecule that interacts 
with the nucleic acid when the fluorescent signal detected before the sample is contacted with 
the test molecule differs from the fluorescent signal detected after the sample is contacted with 
the test molecule. Fluctuations sometimes are reduced fluorescence intensity at a particular 
wavelength, and sometimes are shifts in the wavelengths at which fluorescence is detected. 
Often, the labeled strand is hybridized with a complementary strand and any fluctuations in 
fluorescence are detected upon hybridization, and the labeled hybrid then is contacted with test 
molecules and fluctuations in fluorescence are detected to determine which of the test molecules 
interact with die labeled nucleic acid. In certain embodiments, the sample is contacted with a 
nucleic acid binding protein such as NM23-H2, Spl, CNBP and/or hnRNPK before, at the same 
time, or after the sample is contacted with the test molecule. In other embodiments, the labeled 
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nucleic acid is interacted with test molecules or proteins and the reaction products then are 
subjected to a gel electrophoretic mobility shift assay. 

[0055] Another example of a fluorescence interaction assay is a system that includes a 
nucleic acid, a signal molecule, and a candidate or test molecule. The signal molecule generates 
a fluorescent signal when bound to the nucleic acid (e.g., N-methylmesoporphyrin IX (NMM)), 
and the signal is altered when a candidate compound competes with the signal molecule for 
binding to the nucleic acid. An alteration in the signal when a candidate molecule is present as 
compared to when the candidate molecule is not present identifies the candidate molecule as a 
nucleic acid-interacting molecule. 50 ^1 of nucleic acid is added in 96-well plate. A candidate 
molecule also is added in varying concentrations. A typical assay is carried out in 100 ^1 of 20 
mM HEPES buffer, pH 7.0, 140 mM NaCl, and 100 mM KCI. 50 nl of the signal molecule 
NMM then is added for a final concentration of 3 \iM, NMM is obtained firom Frontier 
Scientific Inc, Logan, Utah. Fluorescence is measured at an excitation wavelength of 420 nm 
and an emission wavelength of 660 nm using a FluroStar 2000 fluorometer (BMG 
Labtechnologies, Durham, NC). Fluorescence often is plotted as a function of concentration of 
the candidate molecule or nucleic acid and maximum fluorescent signals for NMM are assessed 
in the absence of these molecules. 

[0056] DNA cleavage assays are useful for determining at which sites of a nucleic acid a 
nucleic acid binding protein interacts, for example. DNA cleavage assays have been reported 
(e.g., Postel,y. BioL Chem., 274:22821-22829 (1999)), In general, a detectable label is 
incorporated at one portion of the nucleic acid and the label is separated from another portion of 
the nucleic acid having no detectable label or a different detectable label upon cleavage. 
Examples of detectable labels are known, such as fluorophores (e.g., Anantha, et al. 
Biochemistry 37:2709-2714 (1998) and Qu & Chaires, Methods Enzymol 321 :353-369 (2000)), 
fluorescent nucleotide analogs described above, NMR spectral shifts {see, e.g., Arthanari & 
Bolton, Anti-Cancer Drug Design 14:3 1 7-326 (1999)), fluorescence resonance energy transfers 
[see, e.g., Simonsson & SjGback, J. Biol Chem. 274:17379-17383 (1999)), a radioactive isotope 
(e.g., '^^I, ^% ^^P, or ^H); a light scattering label {see, e.g., U.S. Patent No. 6,214,560; 
Genicon Sciences Corporation, San Diego, CA); an enzymic or protein label (e.g., green 
fluorescent protein (GFP) or peroxidase), or another chromogenic label or dye. The nucleic acid 
also can be linked to two fluorophores for a fluorescence resonance energy transfer (FRET) 
assay, where one fluorophore emits light at a wavelength at which the other fluorophore is 
excited, where such fluorescence energy transfer occurs when the nucleic acid is intact and does 
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not occur when the nucleic acid is cleaved by a nucleic acid binding protein. Similarly, a 
candidate molecule linked to a nucleic'acid binding protein can be detected by detecting the 
candidate molecule bound to the protein or a detectable label bound to a candidate molecule 
linked to a binding protein. 

[0057] A gel electrophoretic mobility shift assay (EMSA) is useful for determining whether 
a nucleic acid forms a quadruplex and whether a nucleotide sequence is quadruplex- 
destabilizing. EMSA is conducted as described previously (Jin & Pike, MoL Endocrinol 
10:196-205 (1996)) with minor modifications. Synthetic single-stranded oligonucleotides are 
labeled in the 5' terminus with T4-kinase in the presence of [y-^^P] ATP ( 1 ,000 mCi/mmol, 
Amersham Life Science) and purified through a sephadex column, ^^P-labeled oligonucleotides 
(--30,000 cpm) then are incubated with or without various concentrations of a testing compound 
in 20 \A of a buffer containing 10 mM Tris pH 7.5, 100 mM KCi, 5 mM dithiothreitol, 0.1 mM 
EDTA, 5 mM MgCb, 10% glycerol, 0.05%Nonedit P-40, and 0.1 mg/ml of poly(dI-dC) 
(Pharmacia). After incubation for 20 minutes at room temperature, binding reactions are loaded 
on a 5% polyacrylamide gel in 0.25 x Tris borate-EDTA buffer (0.25 x TBE, 1 x TBE is 89 mM 
Tris-borate, pH 8.0, 1 mM EDTA). The gel is dried and each band is quantified using a 
phosphorimager. 

[0Q58] Another example of an EMS A assay is performed as follows. Ten microliter 
reactions are assembled in Reaction Buffer (50 mM Tris-HCI, pH 7.9, 0.5 mM dithiothreitol, 
and 50 mg/ml bovine serum albumin). MgCh, KCI, EDTA, protease K, and ATP are added. 
Radiolabeled DNA or fluorescently labeled DNA (described above) and NM23-H2 in storage 
buffer (20 mM Hepes, pH 7.9, 5 mM MgCk, 0.1 mM EDTA, 0.1 M KCI, 1 mM dithiothreitol, 
20% glycerol, and protease inhibitors (Postel, etal, Mol Cell Biol 9:5123-5133 (1989)) are 
added last, and the reactions are incubated for 15 minutes at room temperature. To separate the 
protein-DNA complexes, the reactions are loaded onto 5% native polyacrylamide gels and 
electrophoresed in 0.53 TBE buffer (45 mM Tris borate, pH 8.3, 1.25 mM EDTA) at room 
temperature for 30 minutes at 1 00 V. Gels are vacuum-dried and exposed onto XAR (Eastman 
Kodak Co.) film. 

[0059] Chemical footprinting assays are useftil for assessing quadruplex structure. 
Quadruplex structure is assessed by determining which nucleotides in a nucleic acid is protected 
or unprotected from chemical modification as a result of being inaccessible or accessible, 
respectively, to the modifying reagent. A DMS methylation assay is an example of a chemical 
footprinting assay. In such an assay, bands from EMSA are isolated and subjected to DMS- 
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induced strand cleavage. Each band of interest is excised from an electrophoretic mobility shift 
gel and soaked in 100 mM KCI solution (300 ^il) for 6 hours at 4X. The solutions are filtered 
(microcentrifuge) and 30,000 cpm (per reaction) of DNA solution is diluted further with 100 
mM KCI in 0.1 X TE to a total volume of 70 fil (per reaction). Following the addition of 1 jil 
salmon sperm DNA (0.1 ^g/pl)> the reaction mixture is incubated with 1 ^1 DMS solution 
(DMSrethanol; 4:1; v:v) for a period of time. Each reaction is quenched with 18 ^1 of stop 
buffer (b -mercaptoathanol:water:NaOAc (3 M); 1:6:7; v:v:v). Following ethanol precipitation 
(twice) and piperidine cleavage, the reactions are separated on a preparative gel (16%) and 
visualized on a phosphorimager. 

[0060] A polymerase arrest assay is useful for determining whether a candidate molecule 
stabilizes a quadruplex structure. Such an assay includes a template nucleic acid, which often 
comprises a quadruplex forming sequence, and a primer nucleic acid that hybridizes to the 
template nucleic acid 5' of the quadruplex-forming sequence. The primer is extended by a 
polymerase (e.g., Taq polymerase), which advances from the primer along the template nucleic 
acid. In this assay, a quadruplex structure can block or arrest the advance of the enzyme, leading 
to shorter transcription fragments. Also, the arrest assay may be conducted at a variety of 
temperatures, including 45*^C and 60*^C, and at a variety of ion concentrations. An example of a 
Taq polymerase stop assay is described in Han, et aL, NucL Acids Res. 27:537-542 (1999), 
which is a modification of that used by Weitzmann, e/a/., J, BioL Chem. 271, 20958-20964 
(1996). Briefly, a reaction mixture of template DNA (50 nM), Tris'HCl (50 mM), MgCb (10 
mM), DTT (0.5 mM), EDTA (0.1 mM), BSA (60 ng), and 5 '-end-labeled quadruplex nucleic 
acid (-18 nM) is heated to 90**C for 5 minutes and allowed to cool to ambient temperature over 
30 minutes. Taq Polymerase (1 jil) is added to the reaction mixture, and the reaction is 
maintained at a constant temperature for 30 minutes. Following the addition of 10 ^l stop buffer 
(formamide (20 ml), 1 M NaOH (200 jil), 0.5 M EDTA (400 (il), and 10 mg bromophenol blue), 
the reactions are separated on a preparative gel (12%) and visualized on a phosphorimager. 
Adenine sequencing (indicated by "A" at the top of the gel) is performed using double-stranded 
DNA Cycle Sequencing System from Life Technologies. The general sequence for the template 
strands is TCCAACTATGTATAC-INSERT- 

TTAGCGACACGCAATTGCTATAGTGAGTCGTATTA. Bands on the gel that exhibit 
slower mobility are indicative of quadruplex formation. 

[0061] In another example of a polymerase arrest assay, a 5'- fluorescent-labeled (FAM) 
primer (P45, 15 nM) is mixed with template DNA (15nM) in a Tris-HCL buffer (15 mM Tris, 
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pH 7.5) containing lOmM MgCb, 0.1 mM EDTA and 0.1 mM mixed deoxynucleotide 
triphosphates (dNTFs). The FAM-P45 primer (5'- 6FAM- 

AGTCTGACTGACTGTACGTAGCTAATACGACTCACTATAGCAATT.3') and the 
template DNA (5'-TCCAACTATGTATACTGGGGA GGGTGGGGAGGGTGGGGAAGGTT 
AGCGACACGCAATTGCTATAGTGAGTCGTATTAGCTACGTACAGTCAGTCAGACT- 
3') are synthesized and HPLC purified by Applied Biosystems. The mixture is denatured at 
95**C for 5 minutes and, after cooling down to room temperature, is incubated at 2TC for 
15 minutes. After cooling down to room temperature, ImM KCi2 and the test compound 
(various concentrations) are added and the mixture incubated for 1 5 minutes at room 
temperature. The primer extension is performed by adding lOmM KCI and Taq DNA 
Polymerase (2.5 U/reaction, Promega) and incubating at lO^'C for 30 minutes. The reaction is 
stopped by adding 1 \il of the reaction mixture to 10 jil Hi-Di Formamide mixed and 0.25 ^I 
LIZ120 size standard. Hi-Di Formamide and LIZ120 size standard are purchased from Applied 
Biosystems. The partially extended quadruplex arrest product is between 61 or 62 bases long 
and the full-length extended product is 99 bases long. The products are separated and analyzed 
using capillary electrophoresis. Capillary electrophoresis is performed using an ABI PRISM 
3100-Avant Genetic Analyzer. 

[0062] Certain arrest assays are performed in cells. In a transcription reporter assay, test 
quadruplex DNA is coupled to a reporter system, such that a formation or stabilization of a 
quadruplex structure can modulate a reporter signal. An example of such a system is a reporter 
expression system in which a polypeptide, such as luciferase or green fluorescent protein (GFP), 
is expressed by a gene operably linked to the potential quadruplex forming nucleic acid and 
expression of the polypeptide can be detected. As used herein, the term "operably linked" refers 
to a nucleotide sequence which is regulated by a sequence comprising the potential quadruplex 
forming nucleic acid. A sequence may be operably linked when it is on the same nucleic acid as 
the quadruplex DNA, or on a different nucleic acid. An exemplary luciferase reporter system is 
described herein. A luciferase promoter assay described in He, ei aL^ Science 281:1 509-1 5 12 
(1998) often is utilized for the study of quadruplex formation. Specifically, a vector utilized for 
the assay is set forth in reference 1 1 of the He, et aL, document. In this assay, HeLa cells are 
transfected using the lipofectamin 2000-based system (Invitrogen) according to the 
manufacturer's protocol, using 0.1 jig of pRL-TK (Renilla luciferase reporter plasmid) and 0.9 
pg of the quadruplex-forming plasmid. Firefly and Renilla luciferase activities are assayed 
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using the Dual Luciferase Reporter Assay System (Promega) in a 96-well plate format according 
to the manufacturer's protocol. 

[0063] Circular dichroism (CD) sometimes is utilized to determine whether a candidate 
molecule interacts with a quadruplex nucleic acid. CD is particularly useful for determining 
whether a candidate molecule interacts with a nucleic acid in vitro. In certain embodiments, a 
candidate molecule is added to a DNA sample (S each) in a buffer containing 10 mM 
potassium phosphate (pH 7.2) and 10 or 250 mM KCI at 37X and then allowed to stand for 5 
min at the same temperature before recording spectra. CD spectra are recorded on a Jasco J- 71 S 
spectropolarimeter equipped with a thermoelectrically controlled single cell holder. CD 
intensity normally is detected between 220 nm and 320 nm and comparative spectra for DNA 
alone, candidate molecule alone, and the DNA with the candidate molecule are generated to 
determine the presence or absence of an interaction {see e.g. Datta et aL, JACS 123:9612-9619 
(2001)). Spectra are arranged to represent the average of eight scans recorded at 100 nm/min. 
In certain embodiments, CD signals are monitored as a function of temperature. For example, 
CD signals indicative of the presence of a quadruplex structure in a sample can be determined 
and the loss or shift of those signals can be followed as a function of increasing temperature to 
determine at which temperature the quadruplex structure melts. The signals oflen are monitored 
in the presence of test molecule and in the absence of test molecule, and stabilizing molecules 
that increase the melting temperature of the quadruplex structure are identified as candidate 
molecules. 

[0064] A cell proliferation assay is useful for assessing the utility of a candidate molecule 
for treating a cell proliferative disorder in a subject. In a cancer cell proliferation assay, cell 
proliferation rates are assessed as a function of different concentrations of test compounds added 
to the cell culture medium. Any cancer cell type can be utilized in the assay. In one 
embodiment, colon cancer cells are cultured in vitro and test compounds are added to the culture 
medium at varying concentrations. A useful colon cancer cell line is colo320, which is a colon 
adenocarcinoma cell line deposited with the National Institutes of Health as accession number 
JCRB022S. Parameters for using such cells are available at the http address 
cellbank.nihs.go.jp/cell/data/jcrb0225.htm. 

Utilization of Candidate Molecules as Therapeutics ^ 
[0065] Because quadruplexes are regulators of biological processes such as oncogene 
transcription, modulators of quadruplex biological activity can be utilized as cancer therapeutics. 
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For example, molecules that stabilize quadruplex structures can exert a therapeutic effect for 
certain cell proliferative disorders and related conditions because quadruplex structures typically 
down-regulate the oncogene expression which can cause cell proliferative disorders. 
Quadruplex-interacting candidate molecules can exert a biological effect according to different 
mechanisms, which include, for example, stabilizing a native quadruplex structure, inhibiting 
conversion of a native quadruplex to duplex DNA, and stabilizing a native quadruplex structure 
having a quadruplex-destabilizing nucleotide substitution. Thus, quadruplex interacting 
candidate molecules described herein may be administered to cells, tissues, or organisms, 
thereby down-regulating oncogene transcription and treating cell proliferative disorders. The 
terms "treating," "treatment'* and "therapeutic effect" as used herein refer to reducing or 
stopping a cell proliferation rate {e.g., slowing or halting tumor growth) or reducing the number 
of proliferating cancer cells (e.g., removing part or all of a tumor) and refers to alleviating, 
completely or in part, a cell proliferation condition. 

[0066] Quadruplex interacting molecules and quadruplex forming nucleic acids can be 
utilized to target a cell proliferative disorder. Cell proliferative disorders include, for example, 
colorectal cancers. Other examples of cancers include hematopoietic neoplastic disorders, 
which are diseases involving hyperplastic/neoplastic cells of hematopoietic origin (e.g., arising 
from myeloid, lymphoid or erythroid lineages, or precursor cells thereof). The diseases can arise 
from pooriy differentiated acute leukemias, e.g., erythroblastic leukemia and acute 
megakaryoblastic leukemia. Additional myeloid disorders include, but are not limited to, acute 
promyeloid leukemia (APML), acute myelogenous leukemia (AML) and chronic myelogenous 
leukemia (CML) (reviewed in Vaickus, Crit. Rev. in Oncol/HemotoL 1 1 :267-297 (1991)); 
lymphoid malignancies include, but are not limited to acute lymphoblastic leukemia (ALL), 
which includes B-lineage ALL and T-lineage ALL, chronic lymphocytic leukemia (CLL), 
prolymphocytic leukemia (PLL), hairy cell leukemia (HLL) and Waldenstrom's 
macroglobulinemia (WM). Additional forms of malignant lymphomas include, but are not 
limited to non-Hodgkin lymphoma and variants thereof, peripheral T cell lymphomas, adult T 
cell leukemia/lymphoma (ATL), cutaneous T-cell lymphoma (CTCL), large granular 
lymphocytic leukemia (LGF), Hodgkin's disease and Reed-Stemberg disease. Cell proliferative 
disorders also include cancers of the colorectum, breast, lung, liver, pancreas, lymph node, 
colon, prostate, brain, head and neck, skin, liver, kidney, and heart. Candidate molecules also 
can be utilized to target cancer related processes and conditions, such as increased angiogenesis. 
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by inhibiting angiogenesis in a subject (e.g., molecules that stabilize a VEGF associated 
quadruplex structure can inhibit angiogenesis). 

[0067] Thus, provided herein are methods for reducing cell proliferation or for treating or 
alleviating cell proliferative disorders, which comprise contacting a system having a nucleic acid 
comprising a native quadruplex with a candidate molecule identified herein. The system 
sometimes is a group of cells or one or more tissues, and often is a subject in need of a treatment 
of a cell proliferative disorder. A subject often is a mammal such as a mouse, rat, monkey, or 
human. One embodiment is a method for treating colorectal cancer by administering a candidate 
molecule that interacts with a CMYC regulatory nucleotide sequence to a subject in need thereof, 
thereby reducing the colorectal cancer cell proliferation. Another embodiment is a method for 
inhibiting angiogenesis and optionally treating a cancer associated with angiogenesis, which 
comprises administering a candidate molecule that interacts with a VEGF regulatory nucleotide 
sequence to a subject in need thereof, thereby reducing angiogenesis and optionally treating a 
cancer associated with angiogenesis. In another embodiment, a candidate molecule that interacts 
with a HER2/neu G-quadruplex is administered to a subject for the treatment of a fat cell 
proliferative disorder such as obesity. 

[0068] Retroviruses offer a wealth of potential targets for G-quadruplex targeted 
therapeutics. G-quadruplex stmctures have been implicated as fiinctional elements in at least 
two critical secondary structures formed by either viral RNA or DNA in HIV, the dimer linker 
structure (DLS) and the central DNA flap (CDF). Additionally, DNA aptamers which are able 
to adopt either inter- or intramolecular quadruplex structures are able to inhibit viral replication, 
by targeting either the envelope glycoprotein (putatively) or HIV-integrase respectively. 
Although not direct evidence, the latter observation indicates an involvement of native 
quadruplex structures in interaction with the integrase enzyme. 

[0069] Dimer linker structures, which are common to all retroviruses, serve to bind two 
copies of the viral genome together by a non-covalent interaction between the two 5* ends of the 
two viral RNA sequences. The genomic dimer is stably associated with the gag protein in the 
mature virus particle. In the case of HIV, the origin of this non-covalent binding, may be traced 
to a 98 base-pair sequence containing several runs of at least two consecutive guanines, the 3'- 
most of which is critical for the formation of RNA dimcrs in vitro. An observed cation 
(potassium) dependence for the formation and stability of the dimer in vitro, in addition to the 
failure of an antisense sequence to effectively dimerize, has revealed the most likely binding 
structure to be an intermolecular G-quadruplex. 
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[0070] Prior to integratioii into the host genome, reverse transcribed viral DNA forms a pre- 
integration complex (PIC) with at least two major viral proteins, integrase and reverse 
transcriptase, which is subsequently transported into the nucleus by an as yet undefined 
mechanism. The Central DNA Flap (CDF) refers to 99-base length single-stranded tail of the + 
strand, occurring near the center of the viral duplex DNA, which is known to a play a role in the 
nuclear import of the PIC. Oligonucleotide mimics of the CDF have been shown to form 
intermolecular G-quadruplex structures in cell-free systems. 

[0071] Thus, candidate molecules can be used to stabilize the DLS and thus prevent de- 
coupling of the two RNA strands, an event which is necessary for viral replication. Also, by 
binding to the quadruplex structure formed by the CDF, critical protein recognition and/or 
binding events necessary for nuclear transport of the PIC may be disrupted. In either case, a 
substantial advantage can exist over other anti-viral therapeutics. Current Highly Active Anti- 
Retroviral Therapeutic (HAART) regimes rely on the use of combinations of drugs targeted 
towards the HIV protease and HIV integrase. The requirement for multi-drug regimes is to 
minimize the emergence of resistance, which will usually develop rapidly when agents are used 
in isolation. The source of such rapid resistance is the infidelity of the reverse transcriptase 
enzyme which makes a mutation approx. once in every 10,000 base pairs. An advantage of 
targeting critical viral quadruplex structures over protein targets, is that the development of 
resistance is slow or is impossible. A point mutation of the target quadruplex, necessary to 
reduce affinity for the candidate molecule, can compromise the integrity of the critical 
quadruplex structure and lead to a non-functional copy of the virus. A single therapeutic agent 
based on this concept may replace the multiple drug regimes currently employed, with the 
concomitant benefits of reduced costs and the elimination of harmful drug/drug interactions. 

[0072J Thus, provided herein are methods for inhibiting viral propagation in a system, which 
comprise contacting a system having a viral quadruplex-forming nucleic acid sequence with a 
candidate molecule described herein. The system sometimes is a group of cells or one or more 
tissues, and often is a subject in need of a treatment of a viral infection (e.g., a mammal such as a 
mouse, rat, monkey, or human). In an embodiment, provided is a method for treating HIV 
infection by administering a candidate molecule identified herein to a subject in need thereof, 
thereby reducing the HIV titers in the systems and alleviating infection. 

[0073] Any suitable formulation of the candidate molecules described herein can be 
prepared for administration. Any suitable route of administration may be used, including but not 
limited to oral, parenteral, intravenous, intramuscular, topical and subcutaneous routes. 
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[0074] In cases where candidate molecules are sufficiently basic or acidic to form stable 
nontoxic acid or base salts, administration of the candidate molecules as salts may be 
appropriate. Examples of pharmaceutically acceptable salts are organic acid addition salts 
formed with acids that form a physiological acceptable anion, for example, tosylate, 
methanesulfonate, acetate, citrate, malonate, tartarate, succinate, benzoate, ascorbate, a- 
ketogiutarate, and a-glycerophosphate. Suitable inorganic salts may also be formed, including 
hydrochloride, sulfate, nitrate, bicarbonate, and carbonate salts. Pharmaceutically acceptable 
salts are obtained using standard procedures well known in the art, for example by reacting a 
sufficiently basic candidate molecule such as an amine with a suitable acid affording a 
physiologically acceptable anion. Alkali metal (e.g., sodium, potassium or lithium) or alkaline 
earth metal (e.g.^ calcium) salts of carboxylic acids also are made. 

[0075] In one embodiment, a candidate molecule is administered systemically (e.g., orally) 
in combination with a pharmaceutically acceptable vehicle such as an inert diluent or an 
assimilable edible carrier. They may be enclosed in hard or soft shell gelatin capsules, 
compressed into tablets, or incorporated directly with the food of the patient's diet. For oral 
therapeutic administration, the active candidate molecule may be combined with one or more 
excipients and used in the form of ingestible tablets, buccal tablets, troches, capsules, elixirs, 
suspensions, syrups, wafers, and the like. Such compositions and preparations should contain at 
least 0.1% of active candidate molecule. The percentage of the compositions and preparations 
may be varied and may conveniently be between about 2 to about 60% of the weight of a given 
unit dosage form. The amount of active candidate molecule in such therapeutically useful 
compositions is such that an effective dosage level will be obtained. 

[0076] Tablets, troches, pills, capsules, and the like also may contain the following: binders 
such as gum tragacanth, acacia, com starch or gelatin; excipients such as dicalcium phosphate; a 
disintegrating agent such as com starch, potato starch, alginic acid and the like; a lubricant such 
as magnesium stearate; and a sweetening agent such as sucrose, fructose, lactose or aspartame or 
a flavoring agent such as peppermint, oil of wintergreen, or cheny flavoring may be added. 
When the unit dosage form is a capsule, it may contain, in addition to materials of the above 
type, a liquid carrier, such as a vegetable oil or a polyethylene glycol. Various other materials 
may be present as coatings or to otherwise modify the physical form of the solid unit dosage 
fomi. For instance, tablets, pills, or capsules may be coated with gelatin, wax, shellac or sugar 
and the like. A syrup or elixir may contain the active candidate molecule, sucrose or fructose as 
a sweetening agent, methyl and propylparabens as preservatives, a dye and flavoring such as 
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cherry or orange flavor. Any material used in preparing any unit dosage form is 
pharmaceutically acceptable and substantially non-toxic in the amounts employed. In addition, 
the active candidate molecule may be incorporated into sustained-release preparations and 
devices. 

[0077] The active candidate molecule also may be administered intravenously or 
intraperitoneally by infusion or injection. Solutions of the active candidate molecule or its salts 
may be prepared in a buflFered solution, often phosphate buffered saline, optionally mixed with a 
nontoxic surfactant. Dispersions can also be prepared in glycerol, liquid polyethylene glycols, 
triacetin, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these 
preparations contain a preservative to prevent the growth of microorganisms. The candidate 
molecule is sometimes prepared as a polymatrix-containing formulation for such administration 
(eg., a liposome or microsome). Liposomes are described for example in U.S. Patent 
No. 5,703,055 (Feigner, et al) and Gregoriadis, Liposome Technology vols. I to III 
(2nded. 1993). 

[0078] The pharmaceutical dosage forms suitable for injection or infusion can include sterile 
aqueous solutions or dispersions or sterile powders comprising the active ingredient that are 
adapted for the extemporaneous preparation of sterile injectable or infusible solutions or 
dispersions, optionally encapsulated in liposomes. In all cases, the ultimate dosage form should 
be sterile, fluid and stable under the conditions of manufacture and storage. The liquid carrier or 
vehicle can be a solvent or liquid dispersion medium comprising, for example, water, ethanol, a 
polyol (for example, glycerol, propylene glycol, liquid polyethylene glycols, and the like), 
vegetable oils, nontoxic glyceryl esters, and suitable mixtures thereof. The proper fluidity can 
be maintained, for example, by the formation of liposomes, by the maintenance of the required 
particle size in the case of dispersions or by the use of surfactants. The prevention of the action 
of microorganisms can be brought about by various antibacterial and antifungal agents, for 
example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, 
it will be preferable to include isotonic agents, for example, sugars, buffers or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in the 
compositions of agents delaying absorption, for example, aluminum monostearate and gelatin. 

[0079] Sterile injectable solutions are prepared by incorporating the active candidate 
molecule in the required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by Alter sterilization. In the case of sterile powders for 
the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum 
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drying and the freeze drying techniques, which yield a powder of the active ingredient plus any 
additional desired ingredient present in the previously sterile-filtered solutions. 

[0080] For topical administration, the present candidate molecules may be applied in liquid 
form. Candidate molecules often are administered as compositions or formulations, in 
combination with a dermatologicaily acceptable carrier, which may be a solid or a liquid. 
Examples of useful dermatological compositions used to deliver candidate molecules to the skin 
are known (see, e.g., Jacquet, et al (U.S. Pat. No. 4,608,392), Geria (U.S. Pat. No. 4,992,478), 
Smith, et al. (U.S. Pat. No. 4,559,157) and Wortzman (U.S. Pat. No. 4,820,508). 

[0081] Candidate molecules may be formulated with a solid carrier, which include fmely 
divided solids such as talc, clay, microcrystalline cellulose, silica, alumina and the like. Useful 
liquid carriers include water, alcohols or glycols or water-alcohol/glycol blends, in which the 
present candidate molecules can be dissolved or dispersed at effective levels, optionally with the 
aid of non-toxic surfactants. Adjuvants such as firagrances and additional antimicrobial agents 
can be added to optimize the properties for a given use. The resultant liquid compositions can 
be applied from absorbent pads, used to impregnate bandages and other dressings, or sprayed 
onto the affected area using pump-type or aerosol sprayers. Thickeners such as synthetic 
polymers, fatty acids, fatty acid salts and esters, fetty alcohols, modified celluloses or modified 
mineral materials can also be employed with liquid carriers to form spreadable pastes, gels, 
ointments, soaps, and the like, for application directly to the skin of the user. 

[0082] Generally, the concentration of the candidate molecule in a liquid composition often 
is from about 0.1 wt% to about 25 wt%, sometimes from about 0.5 wt% to about 10 wt%. The 
concentration in a semi-solid or solid composition such as a gel or a powder often is about 0.1 
wt% to about 5 wt%, sometimes about 0.5 wt% to about 2.5 wt%. A candidate molecule 
composition may be prepared as a unit dosage form, which is prepared according to 
conventional techniques known in the pharmaceutical industry. In general terms, such 
techniques include bringing a candidate molecule into association with pharmaceutical carrier(s) 
and/or excipient(s) in liquid form or finely divided solid form, or both, and then shaping the 
product if required. Hie candidate molecule composition may be formulated into any dosage 
form, such as tablets, capsules, gel capsules, liquid syrups, soft gels, suppositories, and enemas. 
Hie compositions also may be formulated as suspensions in aqueous, non-aqueous, or mixed 
media. Aqueous suspensions may ftirther contain substances which increase viscosity, including 
for example, sodium carboxymethylcellulose, sorbitol, and/or dextran. The suspension may also 
contain one or more stabilizers. 
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[0083] The amount of the candidate molecule, or an active salt or derivative thereof, 
required for use in treatment will vary not only with the particular salt selected but also with the 
route of administration, the nature of the condition being treated and the age and condition of the 
patient and will be ultimately at the discretion of the attendant physician or clinician. 

[0084] A useful candidate molecule dosage often is determined by assessing its in vitro 
activity in a cell or tissue system and/or in vivo activity in an animal system. For example, 
methods for extrapolating an effective dosage in mice and other animals to humans are known to 
the art (see, e.g., U.S. Pat. No. 4,938,949). Such systems can be used for determining the LD50 
(the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 
50% of the population) of a candidate molecule. The dose ratio between a toxic and therapeutic 
effect is the therapeutic index and it can be expressed as the ratio ED50/LD50. The candidate 
molecule dosage often lies within a range of circulating concentrations for which the ED50 is 
associated with little or no toxicity. The dosage may vary within this range depending upon the 
dosage form employed and the route of administration utilized. For any candidate molecules 
used in the methods described herein, the therapeutically effective dose can be estimated initially 
fix)m cell culture assays. A dose sometimes is formulated to achieve a circulating plasma 
concentration range covering the IC50 (i.e., the concentration of the test candidate molecule 
which achieves a half-maximal inhibition of symptoms) as determined in in vitro assays, as such 
information often is used to more accurately determine useful doses in humans. Levels in 
plasma may be measured, for example, by high performance liquid chromatography. 

[0085] Another example of effective dose determination for a subject is the ability to 
directly assay levels of 'Tree" and "bound" candidate molecule in the serum of the test subject. 
Such assays may utilize antibody mimics and/or "biosensors" generated by molecular imprinting 
techniques. The candidate molecule is used as a template, or "imprinting molecule", to spatially 
organize polymerizable monomers prior to their polymerization with catalytic reagents. 
Subsequent removal of the imprinted molecule leaves a polymer matrix which contains a 
repeated "negative image" of the candidate molecule and is able to selectively rebind the 
molecule under biological assay conditions (see, e.g., Ansell, et al^ Current Opinion in 
Biotechnology 7: 89-94 (1996) and in Shea, Trends in Polymer Science 2: 166-173 (1994)). 
Such "imprinted" affinity matrixes are amenable to ligand-binding assays, whereby the 
immobilized monoclonal antibody component is replaced by an appropriately imprinted matrix 
(see, e.g., Vlatakis, et al. Nature 361: 645-647 (1993)). Through the use of isotope-labeling, 
"free" concentration of candidate molecule can be readily monitored and used in calculations of 
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IC50. Such "imprinted" affinity matrixes can also be designed to include fluorescent groups 
whose photon-emitting properties measurably change upon local and selective binding of 
candidate molecule. These changes can be readily assayed in real time using appropriate fiber 
optic devices, in turn allowing the dose in a test subject to be quickly optimized based on its 
individual IC50. An example of such a "biosensor" is discussed in Kriz, et aL, Analytical 
Chemistry 67: 2142-2144 (1995). 

[0086] Exemplary doses include milligram or microgram amounts of the candidate molecule 
per kilogram of subject or sample weight, for example, about 1 microgram per kilogram to about 
500 milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams per 
kilogram, or about 1 microgram per kilogram to about 50 micrograms per kilogram. It is 
understood that appropriate doses of a small molecule depend upon the potency of the small 
molecule with respect to the expression or activity to be modulated. When one or more of these 
small molecules is to be administered to an animal (e.g., a human) in order to modulate 
expression or activity of a polypeptide or nucleic acid described herein, a physician, 
veterinarian, or researcher may, for example, prescribe a relatively low dose at first, 
subsequently increasing the dose until an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend upon a 
variety of factors including the activity of the specific candidate molecule employed, tfie age, 
body weight, general health, gender, and diet of the subject, the time of administration, the route 
of administration, the rate of excretion, any drug combination, and the degree of expression or 
activity to be modulated. 

[0087] Each document and publication cited is incorporated herein by reference in its 
entirety, including all figures, drawings, tables, text, and documents and publications referenced 
therein. 
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What is claimed is: 

1 . A method for identifying a candidate molecule that modulates a biological 
activity of a nucleic acid capable of forming a G-quadruplex structure, which comprises: 

contacting a test molecule with the nucleic acid, wherein the nucleic acid or a portion 
thereof is in an intramolecular parallel G-quadruplex conformation; and 

determining whether a portion of the test molecule interacts with a site located in the 
intramolecular parallel G-quadruplex conformation, 

whereby a test molecule having a portion that interacts with the site is identified as a 
candidate molecule that modulates the biological activity of the nucleic acid. 

2. The method of claim 1, wherein the test molecule and the nucleic acid is 
contacted in silica. 

3. The method of claim 1, wherein the nucleic acid is double-stranded. 

4. The method of claim 1, wherein the nucleic acid is 30 or fewer nucleotides in 

length. 

5. The method of claim 1, wherein the nucleic acid comprises a nucleotide sequence 
located 5* of the CAm: gene. 

6. The method of claim S, wherein the nucleic acid comprises the nucleotide 
sequence TG4AG3TG4AG3TG4AAGG. 

7. The method of claim 6, wherein the site comprises one or more atoms in a G- 
tetrad of the G-quadruplex. 

8. The method of claim 1 , which further comprises determining whether a portion of 
the test molecule interacts with a site in a secondary structure adjacent to the G-quadruplex. 

9. The method of claim 8, wherein the secondary structure adjacent to the G- 
quadruplex is formed by a nucleotide sequence in a double stranded nucleic acid complementary 
to the nucleotide sequence that forms the G-quadruplex. 
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10. The method of claim 1 or 8, wherein a portion of the test molecule also 
intercalates with a duplex region adjacent to the G-quadruplex. 

1 1 . The method of claim I , wherein the interaction is a hydrogen bond. 

1 2. The method of claim 1 , which further comprises determining whether a candidate 
molecule modulates a biological activity of the nucleic acid. 

13. The method of claim 12, wherein the biological activity is an interaction of a 
protein with the nucleic acid. 

14. The method of claim 13, wherein the protein is NM23-H2. 

15. The method of claim 1 3, wherein the interaction is binding of the protein to the 
nucleic acid. 

16. The method of claim 12, wherein the biological activity is DNA transcription. 

1 7. The method of claim 12, wherein one or more nucleotides of the nucleic acid is 
substituted with a fluorescent nucleotide analog and the biological activity is determined by 
detecting the fluorescence of the nucleic acid. 

1 8. A method for identifying a therapeutic that reduces cell proliferation in a system, 
which comprises contacting a system with a candidate molecule identified by the method of 
claim 1 and determining whether the candidate molecule reduces cell proliferation in the system, 
whereby a candidate molecule that reduces cell proliferation in the system is identified as the 
therapeutic. 

19. The method of claim 1 8 wherein the system is a group of cells. 

20. The method of claim 1 8 wherein the system is an animal. 

21 . A method for stabilizing an intramolecular parallel G-quadruplex conformation of 
a nucleic acid, which comprises contacting the nucleic acid with a quadruplex-interacting 
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molecule in the system, whereby the molecule stabilizes the intramolecular parallel G- 
quadruplex conformation. 

22. The method of claim 21 , wherein the system is a group of cells. 

23. The method of claim 21 , wherein the system is an animal. 

24. A method for treating a cell proliferative condition in a subject, which comprises 
administering a candidate compound identified by the method of claim 1 or a therapeutic 
identified by the method of claim 18 to a subject in need thereof, whereby administering the 
candidate compound or the therapeutic reduces cell proliferation in the subject. 

25. The method of claim 24, wherein the cell proliferative condition is a cancer. 

26. The method of claim 25, wherein the cell proliferative condition is colorectal 

cancer. 

27. The method of claim 24, wherein the cell proliferative condition is angiogenesis. 
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Abstract of the Disclosure 

Featured herein is a method for identifying candidate molecules that interact with a 
nucleic acid having an intramolecular parallel quadruplex structure. The candidate molecules 
often are screened in vitro to determine whether they modulate the biological activity of the 
nucleic acid and sometimes are administered to a subject in vivo to treat a cell proliferative 
condition. Also provided are methods for stabilizing an intramolecular parallel G-quadruplex 
conformation of a nucleic acid by contacting the nucleic acid with a quadruplex-interacting 
molecule. 
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